PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Do000294.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; PACMAD clade; Panicoideae; Panicodae; Paniceae; Dichantheliinae; Dichanthelium
Family GATA
Protein Properties Length: 887aa    MW: 96795.8 Da    PI: 9.5506
Description GATA family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Do000294.1genomeDichanView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GATA46.45.2e-15157188132
        GATA   1 CsnCgttkTplWRrgpdgnktLCnaCGlyyrk 32 
                 C +Cg+ +Tp+WR+gp g+ tLCnaCG++ + 
  Do000294.1 157 CLQCGAAETPQWRSGPMGQGTLCNACGVRLKA 188
                 89**************************9986 PP

2GATA50.33.2e-16294328135
        GATA   1 CsnCgttkTplWRrgpdgnktLCnaCGlyyrkkgl 35 
                 C +Cg+++Tp+WR+gp g +tLCnaCG++yr  +l
  Do000294.1 294 CLHCGSSSTPQWREGPMGRSTLCNACGVRYRQGRL 328
                 99*****************************9886 PP

3GATA54.61.5e-17517551135
        GATA   1 CsnCgttkTplWRrgpdgnktLCnaCGlyyrkkgl 35 
                 C++C +++Tp+WR+gp g++tLCnaCG++y+  +l
  Do000294.1 517 CVHCASSSTPQWREGPKGPSTLCNACGVRYKQGRL 551
                 *******************************9876 PP

4GATA52.37.5e-17755789135
        GATA   1 CsnCgttkTplWRrgpdgnktLCnaCGlyyrkkgl 35 
                 C+ C++t+Tp+WR gp+gn +LCnaCGl+ rk g+
  Do000294.1 755 CVDCRATETPQWRAGPEGNHKLCNACGLRRRKAGE 789
                 ********************************987 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF577163.56E-12151197No hitNo description
SMARTSM004013.6E-13151201IPR000679Zinc finger, GATA-type
PROSITE profilePS5011413.508151205IPR000679Zinc finger, GATA-type
Gene3DG3DSA:3.30.50.102.2E-13155194IPR013088Zinc finger, NHR/GATA-type
CDDcd002027.05E-10156199No hitNo description
PROSITE patternPS003440157182IPR000679Zinc finger, GATA-type
PfamPF003201.5E-12157188IPR000679Zinc finger, GATA-type
SMARTSM004011.8E-15288342IPR000679Zinc finger, GATA-type
PROSITE profilePS5011412.651288324IPR000679Zinc finger, GATA-type
SuperFamilySSF577165.23E-13289347No hitNo description
Gene3DG3DSA:3.30.50.101.7E-13292326IPR013088Zinc finger, NHR/GATA-type
CDDcd002029.21E-13293349No hitNo description
PROSITE patternPS003440294319IPR000679Zinc finger, GATA-type
PfamPF003205.1E-14294328IPR000679Zinc finger, GATA-type
SMARTSM004011.0E-14511561IPR000679Zinc finger, GATA-type
PROSITE profilePS5011412.611511547IPR000679Zinc finger, GATA-type
Gene3DG3DSA:3.30.50.104.0E-14515549IPR013088Zinc finger, NHR/GATA-type
SuperFamilySSF577161.43E-12516573No hitNo description
CDDcd002021.20E-12516563No hitNo description
PROSITE patternPS003440517542IPR000679Zinc finger, GATA-type
PfamPF003202.4E-15517551IPR000679Zinc finger, GATA-type
SMARTSM004018.9E-13749807IPR000679Zinc finger, GATA-type
PROSITE profilePS5011412.4749788IPR000679Zinc finger, GATA-type
SuperFamilySSF577161.95E-11752792No hitNo description
Gene3DG3DSA:3.30.50.103.3E-13754788IPR013088Zinc finger, NHR/GATA-type
CDDcd002021.11E-9755789No hitNo description
PfamPF003206.6E-14755789IPR000679Zinc finger, GATA-type
PROSITE patternPS003440755780IPR000679Zinc finger, GATA-type
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0008270Molecular Functionzinc ion binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 887 aa     Download sequence    Send to blast
MATAERDKLE LPGLGPRTRR SAATRATGDM AGGGGGGEGE RDGEASLGYI LSLPAASLPL  60
PVAVSCLDAT VPRKARSRLR LRAQPCAWWA FKLPAPAPEE AKSPGSLASV AKNPTEEARS  120
PRPQRLRVRQ APLPDPDPDP ETPVPAKERP AKRARRCLQC GAAETPQWRS GPMGQGTLCN  180
ACGVRLKAAG ALREQVHRPP PATARTVAEP PPESPVSDSS PDGPIWEPGS VPDVYLVRKK  240
PSKQGKPPPP RMEPASAPVP APAPAVYLVK KKKKKAPKTS KKKPWRPRKS AKRCLHCGSS  300
STPQWREGPM GRSTLCNACG VRYRQGRLLP EYRPLASPTF EPSEHANRHS QVLQLHRQRK  360
GQKNQHPLPT EQPQPMDDVD PMNVLLPRRW PNKDEYPPTP LHQPLPQPEL AMTRKGRMLP  420
SLAPLREQGH RSPAALTVSD EPRPEEGPVS ESPPDDCRPI WVLEPGSSEA DLYLVKRKTP  480
KSALPPPAPR TEPAPDPEVY LVKKDKPCLP RMLAKRCVHC ASSSTPQWRE GPKGPSTLCN  540
ACGVRYKQGR LLPEYRPRAS PTFELSVHAN RHTQVLRLHR QQQMKGNNRS RAPPPVEQPQ  600
PTEDGLRVIG NNGDSAGDGQ GSGGDDLMYE LPLPWRWHNK PLPDGMGDSD NAWCPWTPFL  660
SRSEQERRSE SDEQQPQPPG SDPTRTRLSA VAGAGCGMVA SSGGAERERE DGDERLDGLA  720
FAIPRKQRSH PVRAEPSGWL AVKLPVPTPP PVKKCVDCRA TETPQWRAGP EGNHKLCNAC  780
GLRRRKAGER WGEQRGHRQA PVSDQPPLPQ ESLPVLEQQQ PPPPEQPQPA DGLASQLRLG  840
GIDDDSANNA AGASAMDHPM GLDPFLLEGP AAPMIIDPEE TSWTDID
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1269283KKKKKKAPKTSKKKP
2270282KKKKKAPKTSKKK
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_004977277.10.0PREDICTED: serine/arginine repetitive matrix protein 1-like isoform X1
TrEMBLK3Y7D90.0K3Y7D9_SETIT; Uncharacterized protein
STRINGSi010130m0.0(Setaria italica)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MonocotsOGMP72732646
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G25830.15e-28GATA transcription factor 12